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ON A DUALITY BETWEEN METRICS AND E-PROXIMITIES 1 
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In studies of discrete structures, functions are frequently used that express the proximity of objects but 
^ ■ do not belong to the family of metrics. We consider a class of such functions that is characterized by 

| a normalization condition and an inequality that plays the same role as the triangle inequality does 

for metrics. We show that the introduced functions, named Yi-proximities ( "sigma-proximities"), are 
in a definite sense dual to metrics: there exists a natural one-to-one correspondence between metrics 
and E -proximities defined on the same finite set; in contrast to metrics, E- proximities measure 
comparative proximity; the closer the objects, the greater the Y,-proximity; diagonal entries of the 
^-proximity matrix characterize the centrality of objects. The results are extended to the case of 
arbitrary infinite sets of objects. 



■3 

A metric on a set A is a function d : A 2 — > R such that for any x,y,z G A, 

(1) d(x, y) = if and only if x = y; 

(2) d(x, y) + d(x, z) — d(y, z) > (triangle inequality). 

> 

' It follows from this definition that for any x,y G A, 

d(x,y) = d(y,x) (symmetry); 
d{x,y) > (nonnegativity). 

: 

Functions that express proximity are not necessarily metrics. Let us consider another class of functions, 

■ whose representatives are frequently encountered and implicitly used in both applied and theoretical studies, for 
\ instance, in analyses of linear statistical models, Markov processes, electrical circuits and economic models, and also 
' in graph theory and network theory [1-9]. 

Definition 1. Suppose that A is a nonempty finite set and E is a real number. A function o : A 2 — > R will 

■ be referred to as a H-proximity {read as "sigma-proximity") on A, if for any x,y, z G A, the following statements are 
. . true: 

(1) normalization condition: ^2 o~{x,t) = S: 

rS , (2) triangle inequality: o~(x, y) + o(x, z) — o~(y, z) < o-(x, x), and if z — y and x =^ y, then the inequality is 

strict. 

The reason why this inequality is referred to as a property of metrics that has a different form will be clear 
from what follows. By virtue of the normalization condition, every matrix that represents a S-proximity has an 
eigenvector of all ones, £ being the corresponding eigenvalue. When considering E-proximities, we will assume that 
the set A and the number E are fixed, unless otherwise specified. 

Proposition 1. Let a be a Yj-proximity on A. Then for any x,y G A, 

<r(x,y) = cr(y,x) (symmetry); 
if x ^ y, then a(x,x) > o(x,y) (egocentrism) . 

The proofs are given in the Appendix. 

The aim of this note is to discover a relationship between E-proximities and metrics. 
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Suppose that d is a metric on a finite set A and \A\ — n. Introduce the following notation: 
d(x,-) = 

n ; 

teA 

<*(■>■) = rf ( s ^)- 

Proposition 2. For any metric d on a Unite set A with \ A\ = n, the function 

a(x, y) = d(x, ■) + d{y, •) - d(x, y) - d(-, •) + - (1) 

n 

is a H-proximity on A. 

The function a constructed from d with the transformation (1) essentially expresses the proximity between 
x and y as compared with the average proximity of x and y to all elements of A. 
Proposition 3. For any T,-proximity a on A, the function 

d(x, y) = ^(<t(x, x) + a(y, y)) - a(x, y) (2) 
is a metric on A. 

Note that for the determination of d(x,y) with (2), it is sufficient to know the values of a on three argu- 
ments: (x,x), (y,y), and (x,y). In this sense, the transformation (2) is local and resembles taking finite differences. 
Conversely, (1) is a discrete integral transformation. A noteworthy feature of S-proximities derived from metrics is 
that they provide relative averaged indices of proximity. In particular, by (1), <j(x,x) — 2d(x, •) — d(-, ■) + i.e., 
a(x,x) is greater for those x that have greater average distance from all elements of A. Thus, a(x,x) measures the 
"provinciality" of x in A: "central" elements have smaller values of a(x, x) than "peripheral" ones. The same is 
suggested from the normalization condition: o~(x,x) = S — ^2 o~(x,y), and the smaller the o~(x,y) (which express 

the proximity of x to the other elements of A), the greater the a(x, x). 

Let ip{d) and ?/>(ct) denote the mappings defined by (1) and (2), respectively. 

LEMMA 1. ip((p(d)) is the identity transformation of the set of metrics dehncd on A. ip(tp(aj) is the 
identity transformation of the set of T,-proximities defined on A. 

According to Lemma 1 and Propositions 2 and 3, the mappings f(d) and ip{°~) defined on the set of metrics 
on A and the set of S-proximities on A, respectively, are mutually inverse. This implies the following theorem. 

THEOREM 1. The mappings ip(d) and ip(o-) = Lp^ 1 (a) determine a one-to-one correspondence of the set 
of metrics on A and the set of T,-proximitics on A. 

O-proximities occupy a central place among S-proximities; other S-proximities can be obtained from them 
by a translation of S/n (see the normalization condition or (1)). The set of O-proximities on A as well as the set 
of metrics on A is closed with respect to addition and multiplication by positive numbers. Note in this connection 
that for S = 0, <p and ip are linear mappings of the corresponding sets. The sets of S-proximities with other 
values of S are closed with respect to convex combinations. One more important class of S-proximities is that of 
1-proximities with nonnegative values. These functions can be represented by symmetric doubly stochastic matrices 
and frequently occur in various applied investigations. It is also worth mentioning S-proximities derived from metrics 
with S = nd(-, ■). Here, the mean proximity equals the mean distance, and the right-hand side of (1) reduces to the 
first three terms; moreover, Va; e A, a(x,x) = 2d(x, •). 

The concept of S-proximity can be extended to infinite sets. A way to do so suggested by the normalization 
condition is to replace summation with integration in this condition. However, if the measure of A is infinite, this 
replacement gives rise to a set of functions that has a structure differing from that in the finite case. In particular, S- 
proximitics with E / cannot be obtained from O-proximities by the addition of a constant function. A generalization 
that preserves the properties observed in the finite case can be constructed by the replacement of summation with 
the operation of averaging. Here there is no need to restrict oneself to an explicit form of the average. Instead, we 
shall consider abstract averaging functionals and require of them only those properties that are necessary for the 
proofs of our statements. 

Suppose that A is a nonempty set, and A 1 and A2 are some sets of functions A — > R and A 2 — > R, respectively. 
Definition 2. A real-valued functional fj, defined on a subset B 1 C A 1 will be referred to as a linear averaging 
functional if \i and B 1 have the following properties. 
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(1) B 1 is a linear space over R containing all constant functions; 

(2) n is a linear functional over B l taking each constant function to its value; 

(3) if f,gEB 1 and \/x G A f(x) > g(x), then /i(f) > /j,(g) (monotonicity) . 

Note that by the Riesz theorem (see, e.g., [10]), under some conditions, among which the most important 
one is continuity, every linear functional is representable as the Stieltjes integral of its argument with respect to some 
charge. 

Let n be a linear averaging functional defined on B x . Suppose that / G A 2 , and for any x E A, f(x ,y) 
belongs to B l as a function of y. Denote by f(x, •) = n y (f(x, y)) the function of x that takes each x to the result of 
the application of fi to f(x, y) as to a function of y. 

Definition 3. We say that a set B 2 Q A 2 is a family of averagable functions of two variables on A if 
— B 2 is a linear space over R that contains all constant functions and all elements of B 1 as functions of each of its 
arguments constant in the other argument, and 
-for any f E B 2 , 

(1) Vx e A g x (y) E B x , where g x (y) = f(x,y); 

(2) f(x, ■) E B 1 and g[x) E B x , where g(x) = f(x,x). 

Now the notion of S-proximity can be generalized as follows. Let A be a nonempty set and suppose that 
B 1 , ji, and B 2 are as defined above; m is a real number. 

Definition 1'. A function o E B 2 will be called a Y* m -proximity on A if for any x,y,z E A, the following 
statements hold: 

(1) normalization condition: cr(x, •) = m, and 

(2) triangle inequality (the same as in Definition 1): cr(x, y) + a(x, z) — a(y, z) < a(x, x), and if z — y and 
x 7^ y, then the inequality is strict. 

The following primed statements are similar to those formulated above. The plans of the proofs remain 
the same, but wherever the properties of the arithmetic mean and the normalization condition in the summation 
form were used, now the properties of a linear averaging functional [i and sets B 1 and B 2 are applied. In particular, 
precisely due to the requirements imposed on B 2 , this set contains the images of the mappings ip and ip. 

Proposition 1'. For any T, m -proximity a E B 2 and for any x,y E A, 

u(x,y) =a(y,x) (symmetry); 

cr(x,x) >cr(x,y) (egocentrism) . 

The reason that the above inequality weakens here is that in the case of infinite A it is natural to require the 
monotonicity rather than the strict monotonicity of ^ (cf. Definition 2 and the proof of Proposition 1). Note that 
the monotonicity of \i is not used in the proofs of the subsequent statements (with the exception of Corollary 1). 
Due to the symmetry of S m -proximities and metrics, we need not require the commutativity of [i x and fi y applied 
to functions from B 2 . This way, the symmetry ensures that the notation d(-, •) = fifi y (d(x,y)) — fifi x (d(x,y)) in the 
following statement is well defined of (not to be confused with /j,(d(x , x))) . 

Proposition 2'. For any metric d E B 2 , the function 



is a metric on A and belongs to B 2 . 

Let f(d) and ip(cr) be the mappings defined by (3) and (4), respectively. 

LEMMA 1'. The mappings ip((p(d)) and ip(tp(cr)) are the identity transformations of the set of metrics 
that belong to B 2 and the set of T, m -proximities on A, respectively. 

THEOREM 1'. The mappings <p(d) and ip(o-) = (p^ 1 (a) determine a one-to-one correspondence of the set 
of metrics that belong to B 2 and the set of Y, m -proximitics on A. 

Note, in conclusion, that turning to S m -proximities can be of help in proving some statements about average 
distances. For example, the fact stated below immediately follows from Theorem 1' and Proposition 1'. 

COROLLARY 1. For any set A, metric d E B 2 , and x E A, 



a(x, y) = d(x, •) + d(y, •) - d(x, y) - d(-, •) + m 
is a T, m -proximity on A. 

Proposition 3'. For any T, m -proximity a, the function 



(3) 



d(x, y) = -(cr(x, x) + a(y, y)) - a(x, y) 



(4) 



d(x, •) > 



d(;-) 



(5) 



2 
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Observe, for completeness, that this inequality cannot be refined by replacing 1/2 with a greater factor. 
Indeed, it suffices to consider an infinite A (or a sequence of finite A with an increasing number of elements) and the 
"almost discrete" metric with one "central" element x such that ^x l ,x 2 G A \ {x }, d(x 1 ,x 2 ) = 1, d(x 1 ,x a ) = 1/2. 
If A is finite, it follows from Theorem 1 and Proposition 1 that the inequality (5) takes a strict form. By considering 
the set A that consists of a line segment and a remote point, it is easy to show that the ratio d(x, -)/d{-, •) (here 
d(-, •) > is assumed) is not bounded above. 

APPENDIX 

Proof of Proposition 1. Symmetry is shown by putting z = x in the triangle inequality and using the 
arbitrariness of x and y. To prove egocentrism, consider again the triangle inequality, now assuming x ^ y: for any 
z e A, we have 

a(x, y) + <t(x, z) - a(y, z) < a(x, x). 

Summing these inequalities over all z G A and taking into account that at z — y the inequality is strict, we have 

zeA zeA 

where n = \A\, and by the normalization condition, <r(x,x) > a(x,y). □ 

Proof of Proposition 2. The normalization condition is verified straightforwardly. To prove the triangle 
inequality, note that the substitution (1) yields 

a(x, x) + a(y, z) — a(x, y) — a(x, z) = d(x, y) + d(x, z) — d(y, z), (6) 

and the nonstrict part of the triangle inequality for a follows from the inequality of the same name for d. At z = y 
and x y, the right-hand side of (6) becomes 2d(x,y) — d(y,y), and the strict statement of the triangle inequality 
for g follows from the first axiom of metrics and the nonnegativity of d(x, y). □ 

Proof of Proposition 3. By virtue of (2), d(x,x) = for any x G A, and according to the strict part 
of the triangle inequality for a, d(x, y) > when x ^ y. To prove the triangle inequality for d, it suffices to make 
substitution (2) resulting in the familiar equality (6), 

d(x, y) + d(x, z) — d(y, z) — a(x, x) + <j(y, z) — a(x, y) — a(x, z), 

and to use the nonstrict part of the triangle inequality for a. □ 

Proof of Lemma 1. The first statement is verified by substituting (1) in (2) and making use of the first 
axiom of metrics; the second one is verified by substituting (2) in (1) and using the normalization condition. □ 

Proof of Corollary 1. Consider the S -proximity a to which the operator <p takes d. By (3), for any 
x G A, we have 

a(x,x) = 2d(x,-) - d(-,-). (7) 

If a(x,x) < 0, then egocentrism implies <r(x,y) < a(x,x) < Vy G A, and the normalization condition is broken by 
virtue of Definition 2. Hence, a(x,x) > 0, and combining it with (7) yields the required inequality. □ 
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